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This paper investigates optimal portfolio strategies in a market where the drift is driven 
by an unobserved Markov chain. Information on the state of this chain is obtained from 
stock prices and expert opinions in the form of signals at random discrete time points. As 
in Frey et al. (2012), Int. J. Theor. Appl. Finance, 15, No. 1, we use stochastic filtering 
to transform the original problem into an optimization problem under full information 
where the state variable is the filter for the Markov chain. This problem is studied 
with dynamic programming techniques and with regularization arguments. Using results 
from the recent literature we obtain the existence of classical solutions to the dynamic 
programming equation in a regularized version of the model. From this the optimal 
strategy in the regularized model is straightforward to compute. We give convergence 
results which show that this strategy is e-optimal in the original model. 

Keywords: Portfolio optimization, hidden Markov model, dynamic programming equa- 
tion, regularization, e-optimal strategy 



1. Introduction 

It is well-known that optimal investment strategies in dynamic portfolio optimiza- 
tion depend crucially on the drift of the underlying asset price process. On the 
other hand it is notoriously difficult to estimate drift parameters from historical 
asset price data. Hence it is natural to include expert opinions or investors' views 
as additional source of information in the computation of optimal portfolios. In the 
context of the classical one-period Markowitz model this leads to the well-known 
Black-Littermann approach, where Bayesian updating is used to improve return 
predictions (see Black <fc Litterman (1992)). 

Frey et al. (2012) consider expert opinions in the context of a dynamic portfolio 
optimization problem in continuous time. In their paper the asset price process is 
modelled as diffusion whose drift is driven by a hidden finite-state Markov chain 
Y. Investors observe the stock prices and in addition a marked point process with 
jump-size distribution depending on the current state of Y that represents expert 
opinions. Frey et al. (2012) derive a finite-dimensional filter p t for the state of Y 
and they reduce the portfolio optimization problem to a problem under complete 
information with state variable given by the filter p t . Moreover they write down 
the dynamic programming equation for the value function V of that problem and 
they give an expression for the optimal strategy in terms of V and its derivatives. 
The precise mathematical interpretation of the dynamic programming equation is 
however left open; in particular it is not clear if a classical solution exists and thus 
if the formula for the optimal strategy is mathematically meaningful. 
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These issues are addressed in the present paper. The main difficulty in establish- 
ing the existence of a classical solution is the fact that the dynamic programming 
equation is not strictly elliptic if the number of states of Y is larger than the num- 
ber of assets. In order to overcome this problem we use a rcgularization argument 
where an additional noise term of the form m~^dBt 1 B an independent Brownian 
motion of suitable dimension and m € N large, is added to the dynamics of the 
state process p. The dynamic programming equation associated with the regular- 
ized optimization problem is strictly elliptical so that a recent results of Davis & 
Lleo (2011) imply the existence of a classical solution V m . Moreover, the optimal 
strategy for the regularized problem exists and is of the same form as the candi- 
date optimal strategy for the original problem (but of course with V m instead of 
V). Adapting arguments from Pham (1998) we show that for m — > oo reward- and 
value function for the regularized problem and the original problem converge. This 
implies that for m sufficiently large the optimal strategy for the regularized prob- 
lem is a nearly-optimal strategy in the original problem so that we have solved the 
problem of finding good strategies, at least theoretically. 

The related literature on portfolio optimization under partial information is dis- 
cussed in detail in the companion paper Frey et al. (2012). Here we just mention the 
papers Ricder & Bauerle (2005) and Sass & Haussmann (2004) that are concerned 
with portfolio optimization in models with Markov-modulated drift but without 
any extra information. 

The paper is organized as follows. In Section [5] we introduce the model of the 
financial market and formulate the portfolio optimization problem. For this problem 
we derive in Section [3] the dynamic programming equation in the case of power 
utility. In Section [4] we reformulate the state equation in terms of an exogencous 
Poisson random measure. For this reformulated state equation we provide in Section 
[5] an explicit construction of the jump coefficient. The main results of this paper 
are presented in Section [6] Here we study a regularized version of the dynamic 
programming equation and investigate e-optimal strategies. 

2. Model and Optimization Problem 

The analytical setting is based on that proposed by Frey et al. (2012). For a fixed 
date T > representing the investment horizon, we work on a filtered probability 
space (Q, Q, G, P), with filtration G = [Qt)te[o,T] satisfying the usual conditions. 
All processes are assumed to be G-adapted. For a generic G-adapted process H we 
denote by G H the filtration generated by H . 

Price dynamics. We consider a market model for one risk-free bond with price 
= 1 and n risky securities with prices St = {S\ , . . . , S™) T given by 

n 

dS\ = S l t (v l (Yt)dt + J2 <T ij dW}\ , S l Q = s\ i = 1, ■ ■ ■ , n. (2.1) 

3=1 

Here fj, = fJ.(Y t ) € R n denotes the mean stock return or drift which is driven by 
some factor process Y described below. The volatility a = {^)\<i,j<n is assumed 
to be a constant invertible matrix and Wt = (W} , • • • -W™) is an n-dimensional G- 
adapted Brownian motion. The invertibility of a always can be ensured by a suitable 
parametrization if the covariance matrix aa^ is positive definite. The factor process 
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Y is a finite-state Markov chain independent of the Brownian motion W with state 
space {ei, . . . , e^} where e^ is the ith unit vector in M. d . The generator matrix is 
denoted by Q and the initial distribution by p = (p 1 , . . . ,p d ) T . The states of the 
factor process Y are mapped onto the states /ii , . . . , ji^ of the drift by the function 
fi(Y t ) = MY t , where M lk = fi' k = //(eft), 1 < I < n, 1 < k < d. 

Define the return process R associated with the price process S by dR\ — dS' l t / S\, 
i = 1, . . . , n. Note that R satisfies dR t = fi(Y t )dt + adWt, and it is easily seen that 
G R = G log s = G s . This is useful, since it allows us to work with R instead of S in 
the filtering part. For details we refer to Frey et al. (2012). 

Investor Information. We assume that the investor does not observe the factor 
process Y directly; he does however know the model parameters, in particular the 
initial distribution p, the generator matrix Q and the functions £t l (-). Moreover, 
he has noisy observations of the hidden process Y at his disposal. More precisely 
we assume that the investor observes the return process R and that he receives at 
discrete points in time T n noisy signals about the current state of Y. These signals 
are to be interpreted as expert opinions; specific examples can be found in the 
companion paper Frey et al. (2012). 

We model expert opinions by a marked point process / = (T„, Z n ), so that at T n 
the investor observes the realization of a random variables Z n whose distribution 
depends on the current state Yr n of the factor process. The T n are modeled as 
jump times of a standard Poisson process with intensity A, independent of Y, so 
that the timing of the information arrival does not carry any useful information. The 
signal Z n takes values in some set Z C M K , and we assume that given Yp n = eft, 
the distribution of Z n is absolutely continuous with Lebesguc-dcnsity fk{z). We 
identify the marked point process I = (T n , Z n ) with the associated counting measure 
denoted by I(dt, dz). Note that the G-compensator of I is Xdt X)ft=i ^-Y t =e k fk(z)dz. 

Summarizing, the information available to the investor is given by the investor 
filtration V with 

Ft = Qf V Q{ , 0<t< T. (2.2) 



Portfolio and optimization problem. We describe the selffinancing trading of 
an investor by the initial capital xq > and the n-dimensional F-adapted trading 
strategy h where h\, i = 1, . . . ,n, represents the proportion of wealth invested in 
stock i at time t. It is well-known that in this setting the wealth process has 
the dynamics 

jyC 1 ) n JC! 

-^ W =Y,^=hJ i x{Y t )dt + h T t adW t , X { h) =x . (2.3) 

We assume that for all t £ [0,T] the strategy h t takes values in some convex and 
compact subset K of K n . The set K can be used to model constraints on the 
portfolio; for instance the choice 

n 

K = {h € M" : hi > c\ for some c\ < 0, hi < c-2 for some c-2, > 1} 

i=l 

would correspond to a limit \c\\ on the amount of shortselling and a limit C2 for 
leverage. Moreover, the assumption that ht € K for all t facilitates many technical 
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estimates in the paper. We denote by 

H = {h = (/k)te[o,T] : h is F adapted and h t g K for all t} (2.4) 

the class of admissible trading strategies. Since fJ.(Y t ) is bounded and a is constant, 
equation (|2.3|) is well denned for all h G H. 

We assume that the investor wants to maximize the expected utility of terminal 
wealth for power utility U(x) = ?C, 6 < 1, 9 =/= 0@ The optimization problem thus 
reads as 

max{£:([/(X^ ) )) :herl}. (2.5) 

This is a maximization problem under partial information since we have required 
that the strategy h is adapted to the investor filtration F. 

Partial information and filtering Next we explain how the control problem 
(|2.5p can be reduced to a control problem with complete information via filtering 
arguments. We use the following notation: for a generic process H we denote by 
H t = E(H\Ft) its optional projection on the filtration F, and the filter for the 
Markov chain Y t is denoted by p t = (pi , • • • ,pf) with p\ = P(Y t = e^l-Ft), k = 
1 . . . ,d. Note that for a process of the form H t = h(Y t ) the optional projection is 
given by h(Y t ) = J2k=i h( e k)Pt- I n particular, the projection of of the drift equals 

d 

/i(F t )=^/i(e fc )p t fc = Mpt. 
fc=i 

The following two processes will drive the dynamics of pt- First, let 

W t := a-^Rt - [ Mp s ds). 
Jo 

By standard results from filtering theory W is an F-Brownian motion (the so-called 
innovations process). Second, define the predictable random measure 

d 

ui(dt,dz) = XdtJ2Pt-fk(z)dz. 
fe=i 

By standard results on point processes vi is the F-compensator of /, see for in- 
stance Bremaud (1981). The compensated random measure will be denoted by 
I(dt, dz) := I(dt, dz) - u r (dt, dz). 

Using a combination of the HMM filter (sec e.g. Wonham (1965), Elliott et al. 
(1994), Liptser & Shiryaev (2001)) and Bayesian updating, in Frey et al. (2012) the 
following <i-dimensional SDE system for the dynamics of the filter p is derived 

dp t = Q T Ptdt + f3 T ( Pt )dW t + J 1I (p t -,z)I(dt, dz) (2.6) 

a The case 8 = corresponds to logarithmic utility U(x) = In a: which is treated in Frey et al. 
(2012). 
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with initial condition p§ = p k . Here, the matrix (3 = /3(p) = . . . , (3d) S M. nxd 
and the vector 7/ = yi(p, z) = (7), . . . , 7j ) T e R d are defined by 

and 7/ *(p,z)=/Y = ^--lY l<fc<d, with ^// t (z). 

(2.7) 

It is well-known (see e.g. Lakner (1998), Sass and Haussmann (2004)) that the 
F-semimartingalc decomposition of X is given by 



dX. 



(/<) 



hj Mp t dt + hJadWt. (2.8) 



Now note that for a constant strategy h t = h G K the (d + l)-dimcnsional process 
(X( h \p) is an F-Markov process as is immediate from the dynamics in (12.61) and 
(12. 8[) . Hence the optimization problem (|2.5[) can be considered as a control problem 
under complete information with the (d + l)-dimcnsional state variable process 
(X( h \p). This control problem is studied in the remainder of the paper. 



3. Dynamic Programming Equation for the Case of Power Utility 

A simplified optimization problem. As a first step, we simplify the control 
problem by a change of measure. As shown in Nagai & Runggaldier (2008) this 
measure change leads to a new problem where the set of state variables is reduced 
to p and where the dynamic programming equation takes on a simpler form. First we 
compute for an admissible strategy h <E% the utility of terminal wealth U(X^) = 
\{X [ t ] ) . From flU} it follows that 

\ (4' 1) ) 9 = y cxp {9 £ (hj MyT) - 1 \a T h s \ 2 )ds + e £ h T s adW s }, (3.1) 

where |.| denotes the Euclidean norm. Define now the random variable = 
exp { f Q T 9hJ adW s — | J Q T |#<7 T /i s | 2 ds} and the function 

b(p, h; 9) = -0(h T Mp - \a T h\ 2 ) . (3.2) 

Recall that n(Y s ) = Ylk=i /^-Ps = Mp s - Hence (|3.1|l can be written in the form 

J(4 ft V = y 4 fc) exp { jT -b(p s , h s - 6)ds) . (3.3) 

Since a is deterministic and h is bounded, the Novikov condition implies that 
E(L { t ] ) = 1. Hence we can define an equivalent measure P h on Tt by dP h / dP = 
L)p, and Girsanov's theorem guarantees that B t :— Wt — 9 / * a T h s ds is a standard 
F-Brownian motion. Substituting into (|2.6j) we find the following dynamics for the 
filter under P h 

dp t = a(p t , h t )dt + (3 T (p t )dB t + J ji(p t -, z)T(dt, dz) (3.4) 
where a = a{p, h) = Q T p + 9(3 l (p)a T h. (3.5) 
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In view of these transformations, for < 6 < 1 the optimization problem (|2.5[) is 
equivalent to 

max{^(exp{y -b{pf*' h \ h s ; 0)<te}j : h g (3.6) 

where we denote by ps P ' h ^ the solution of Q3.4p for s £ [t, T] starting at time 
t € [0,T] with initial value p € 5 for strategy h £ TL. For < on the other hand 
(|2.5[) is equivalent to minimizing the expectation in (|3 .€>[) . In the sequel we will 
concentrate on the case < 8 < 1; the necessary changes for 8 < will be indicated 
where appropriate. Moreover, 8 will be largely removed from the notation. The 
reward and value function for this control problem are given by 

UT 
-b{p^' p ' h \h s )ds^ toih&H, 

(3 7) 

V(t,p) = sup{v(t,p,h):h€H}. y ' J 
Note that v(T,p,h) = V(T,p) = 1. 

The dynamic programming equation. Next, we derive the form of the dynamic 
programming equation for V(t,p). We begin with the generator of the state process 
Pt (the solution of the SDE (|3.4[1 ) for a constant strategy h t = h. Denote by S = 
{p e R d : J2i=iP l = l iP l > M = 1= the unit simplex in Standard 

arguments show that the solution of this SDE is a Markov process whose generator 
C h operates on g e C 2 (S) as follows 



i,j=l i=l 

+ A / + 7/(^*0) ~ g(p)}7(z,p)dz. 

By standard arguments the dynamic programming equation associated to this 
optimization problem is 



Vi 



t {t,p) + suv{c h V(t,p)-b(p 1 h-8)V{t 1 p)X = 0, (t,p)e[Q,T)xS, (3.9) 
heK <• J 

with terminal condition V^(T,p) = 1. In case that 8 < the equation is similar, but 
the sup is replaced by an inf. Plugging in C h as given in (|3.8I) and b(p, h) as given in 
(|3.2[) into (|3.9p the dynamic programming equation can be written more explicitly 
as 

^ d d d 

= V t (t,p) + - E /3j(Pt)A(p t )^v^P) + E{E^V}V(^P) 

fe,/=i fe=i ;=i 

+ A y {V(t,p + 7/ (p, z)) - V(t,p)}J(z,p)dz (3.10) 
d 

+ sup { Y,Pk(Pt)<T T ehV p *(t,p) + 8V(t,p)(h T M P - \a T h\ 2 (l-8))Y 

heK fc=i 

Suppose for the moment that a classical solution to (|3.10[) exists. In that case the 
optimal strategy is easily computed. The argument of the supremum in the last line 
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of (|3.10p is quadratic in h and strictly concave (as aa 1 is positive definite). Hence 
this function attains a unique maximum on the convex set K. If K is sufficiently 
large this maximum is attained at the solution h* of the following linear equation 
(the first-order condition for the unconstrained problem) 

d 

aY,Mp)V pk (t,p) + V(t,p)(Mp - aa T h{l - 0)) = 0. 

fc=i 

Since a is an invcrtible matrix h* is given by 

1 1 ^ 

h*=h*(t,p) = ^^(a^r^Mp+y^—^Y^Mp^^p)}. (3.11) 

So far, our derivation of the HJB equation and of the candidate optimal strategy 
h* in (|3 . 111) is purely heuristic. A possible approach to give a precise mathematical 
meaning to the equation is to use verification arguments: if we can find a classical 
solution V G C 1,2 ([0,T) x S) of equation (|3. 10[) with bounded derivatives, the func- 
tion h* introduced in (|3.11[) is bounded. Hence there exists a solution p* of the SDE 
(|3.4[) with h t = h*(t,Pt). Then a standard verification result such as Theorem 3.1 
of Fleming & Soner (2006) immediately gives that V is the value function of the 
control problem (|3.6[) and that h* t := h*(t,p*) is the optimal strategy. 

However, the existence of a classical solution of equation (|3.10[) is an open issue. 
The main problem is the fact that one cannot guarantee that the equation is uni- 
formly elliptic. To see this note that the coefficient matrix of the second derivatives 
in (|3.10p is given by C(p) — (3 T (p)/3(p). By definition equation (|3.10|) is uniformly 
elliptic if the matrix C(p) is strictly positive definite uniformly in p. A necessary 
condition for this is that there are no non-trivial solutions of the linear equation 
fix — so that we need to have the inequality n > d (at least as many assets 
as states of the Markov chain Y). Such an assumption is hard to justify econom- 
ically; imposing it nonetheless out of mathematical necessity would severely limit 
the applicability of our approach. 

In the present paper we therefore study an alternative route to justifying equa- 
tions (|3.10j) and ()3.11|) . In Section [6] we use regularization arguments to show that 
(|3.11j) can be used to compute an approximately optimal strategy. More precisely, 
we add a term -4=dB t , with to G N and B a Brownian motion of suitable dimension 
and independent of B, to the dynamics of the state equation (|3.4j) . The HJB equa- 
tion associated with these regularized dynamics has an additional term ^-AV, A 
the Laplace operator, and is therefore uniformly elliptic. 

Hence the results of Davis & Lleo (2011) apply directly to the modified equation, 
yielding the existence of a classical solution V m . Moreover, the optimal strategy 
rn h* of the regularized problem is given by (|3.11j) with V m instead of V. We then 
derive convergence results for the reward- and the value function of the regularized 
problem as to — > oo. In particular, we show in Theorem 16.51 that for m sufficiently 
large m h* is approximately optimal in the original problem. 

An alternative justification for the dynamic programming equation (|3.10p can 
be given using the theory of viscosity solutions: using our estimates in Section [S] 
and results from Pham (1998) it is possible to show that V is a viscosity solution 
of (|3.10p . Note however, that this approach does not immediately provide a recipe 
for the computation of a (nearly) optimal strategy. 
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4. Reformulation of the State Equation 

To carry out the program described above we have to reformulate the state equation 
for a number of reasons. First, in our model the state variable p takes values in the 
simplex S which is a subset of a d — 1-dimcnsional hypcrplane of M. d . If wc introduce 
the announced regularization to the diffusion part of the state equation then the 
state variable will leave this hypcrplane and takes values in the whole K d so that the 
normalization property of p is violated which creates technical difficulties. Second, 
in our analysis we need to apply results from the literature on the theory of dynamic 
programming of controlled jump diffusions, such as Pham (1998) and Davis & Lleo 
(2011). These papers consider models where the jump part of the state variable is 
driven by an exogenous Poisson random measure. In our model, on the other hand, 
the measure / is not an exogenous Poisson random measure since the law of the 
compensator vj depends on the solution p t . 

Restriction to a d — 1-dimensional state. We rewrite the state equation 
in terms of the 'restricted' (d — l)-dimensional process ir = (ir 1 , . . . , 7r d_1 ) T = 
(p 1 , . . . ,p d_1 ) T . Then the original state p can be recovered from ir by using the 
normalization property for the last component p d and wc define p = Rn := 
(•7Ti, . . . , nd-i, 1 — X)f=i 7r ■ Assuming p € S implies that the restricted state 
process takes values in 

d-l 

S = |tt e M d_1 : ^7T J < 1,7T J > 0, i = l,...,d-l\. 

i=l 

Now the state equation for tt G S_ associated to (|3.4p reads as 

dn = a{7tt,ht)dt + ^(Ttt)dB t + J /(tt*-, z)I{dt, dz) (4.1) 

where the coefficients are given by 

a(7r, h) = (a^Rn, h),..., a^^Rir, h)) T e E d_1 (4.2) 

£(tt) = 03i(#O,---,/3d-iW) elT^- 1 (4.3) 

7/tt, z) = (riiRn, z), . . . , ^{Rix, z)f e (4.4) 

It is straightforward to give an explicit expression for a, j3 and 7 , but such an 
expression is not needed in the sequel. The original state can be recovered from 7r 
by setting p = Rtt. 

Exogenous Poisson random measure. In the remainder of the paper we as- 
sume that the state process solves the following SDE 

h t )dt + P 1 (ir t )dB t + / j(jr t _,u)N(dt,du), (4.5) 
Ju 

where a and /? are defined above, 7 : S x U — > and N is the compensated 

measure to some finite activity Poisson random measure N with jumps in a set 
U C M. K . The compensator of N is denoted by i/(du)Xdt, i.e. we have N(dt,du) = 
N(dt,du) — v(dz)\dt. In the next section we show that for a proper choice of 7(-) 
and N(dt, du) the solution of (|4.5[) has a same law as the original state process from 

USD. 
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In order to ensure that SDE (|4.5|) has for each control h € % a unique strong 
solution and for the proof of some of the estimates in Section [6] the coefficients a, f3 
and 7 have to satisfy certain Lipschitz and growth conditions (see Jacod & Shiryaev 
(2003) and Pham (1998)). These conditions are given below. For technical reasons 
we require that the conditions hold not only for n G S_ but also for a slightly larger 
set .Sg D S_ defined for sufficiently small e > by 

S, := {tt G K^ 1 : dist (tt,5) < e}, 

where we denoted the distance of it G R d_1 to S_ by dist (tt,<S) := inf{|7r— 7To|oo : 7i"o G 
S}, for |7r|oo the maximum norm on R d_1 . 

Assumption 4.1 (Lipschitz and growth conditions). There exist constants 
Cl,£ > and a function p :U — > M+ wz£/i J u p 2 (u)u(du) < +oo smc/i i/jai /or aZ/ 
7Ti , 7T2 € 6> e , £ < £ and k 



1, 



sup | a (tti , ft) - a (7T 2 , ft) | 



2* fa) 
a(7r, /i)| 



< C*L kl - 7T 2 | 



^(tt) <C l (1 + H), 

|7(7Tl,«) — 7(7T2,u)| < jO(lt) (TTI - 7T 2 | 

|7(7r, u) I < p(u)(l + \ir\). 



(4.6) 

(4.7) 

(4.8) 
(4.9) 



In our case the coefficients a and (3 are continuously differentiable functions of 
tt on the compact set S_ E and h G A' is bounded. Hence, the Lipschitz and growth 
condition (|4.6|) and (|4.7p arc fulfilled. Specific conditions on the densities /&(•) that 
guarantee (|4.8[) and (|4.9[) are given in the next section. 



5. State Equation with Exogeneous Poisson Random Measure 

In this section we show how a solution of the state equation (|4.ip can be constructed 
by means of an SDE of the form (|4.5[) that is driven by an exogenous Poisson random 
measure. The main tool for constructing 7 will be the so-called inverse Rosenblatt 
or distributional transform, see Riischendorf (2009), which is an extension of the 
quantile transformation to the multivariate case. 

We impose the following regularity conditions on the functions /&(•) that repre- 
sent the conditional densities of Z n given Yt„ = &h- 

Assumption 5.1. All densities fk{z), 1 < k < d, are continuously differentiable. 
and have the common support Z. We assume that Z is a n-dimensional rectangle 
[a, 6] C R K , i.e. 

Z = {z eM K : -00 < a k < z k < b k < 00, k = 1, . . . , k}. 

Moreover, there is some < C± such that fk(z) > C\ for all z G Z , k = 1, . . . , d 

Remark 5.2. Examples for densities that satisfy the above assumption are easily 
constructed. Start with C 1 -dcnsitics f k and choose some (large) rectangle Z then 

f k (z) = (l-e) l z ( z ) + £j ^l z (z) where \Z\ = J]^ - a,) (5.1) 

J z fk{u)du \*\ i=1 

k = 1, . . . , k, e G (0,1], satisfy the requirements of Assumption I5~T1 Intuitively, (|5.ip 
corresponds to a mixture of the original density f k and the uniform distribution. 
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The latter distribution carries no information so there is uniform a lower bound on 
the information carried by a single expert opinion. 

In order to write our state equation (|4.1[) in the form (|4.5I) with exogenous 
Poisson random measure we apply the inverse Rosenblatt transform, see for instance 
see Ruschcndorf (2009). Denote by U = [0, 1] K the unit cube in R K . In our context 
the inverse Rosenblatt transform is a mapping G :IA — > Z such that for a uniform 
random variable U on [0, 1] K the random variable Z = G(U) has density f(z,p) = 
^2j = xP* fj(z), P = Rk\ the mapping G can thus be viewed as a generalization of 
the well-known quantile transform. 

Now we explain the construction of the transformation G in detail. First, wc 
define for k = 1, . . . , re — 1, p = Rtt, the marginal densities 

fz 1 ...z k (zi,...,Zk,p) = / ■•■/ f{zi,---,Zk,s k+1 ,...,s K ,p)ds K ...ds k+ x, (5.2) 

For k — re we set fzi...z n '■— /■ Next we define for k = 2, . . . , re the conditional 
densities 

r / | N fz 1 ...Z k (Zl,...,Zk,P) 

fz k \z 1 ...z k _ 1 { z k\zi,---z k -i,p) := - j- -r 

JZ 1 ...Z k -i{Zl, ■ ■ .,Zk-l,P) 

and the associated distribution functions 

F Zk \z 1 ...z k _ 1 {zk\zi,...z k -i,p) = ( fz k \z 1 ...z h - l {sk\zi, ■ ■ ■ z k -x,p)ds k ; 

Ja k 

for k — 1 wc denote by Fz ± the distribution function of Z±. Now we introduce the 
Rosenblatt transform F : Z —> [0, 1] K = U by 

F(z,p) =F( Zl ,...,z K ,p) = (F 1 (z,p),...,F K (z lP )) T where (5.3) 
Ft(z,p) = F Zl (zi,p) and F k (z,p) = F Zk \z 1 ...z k ^ 1 {zk\zi, ■ . ■ z k -±,p), k = 2,...,K. 

Clearly, F k {z,p) depends on the first k variables z\, . . . , z k , only. Put for Z\, . . . , z k -i 
fixcnl 

H k {z) = F k (zi, z k -x,z,p) = F Zh \zi...z k _ x {z\zi, . . . z k -i,p) ■ 

Then Hk is a cumulative distribution function. Assumption 15.11 ensures that the 
joint density f(z.p) is finite and bounded away from zero. Hence, the conditional 
densities fz k \z 1 ...z k - 1 ( z l z i? • ■ • z k-i,p) are strictly positive, H k is strictly increasing 
and there exist an inverse function H^ 1 : [0, 1] — >• M. 

Now the desired transformation Z = G(U) = G(U,p) with the transforma- 
tion function G : U — > Z can be defined recursively by G(u) = G(u,p) = 
(G 1 (u,p), . . . ,G K (u,p)) T with 

Gi{u,p) = H^im) = F z ^{u u p) =: z 1 

G k {u,p) = H~ x {u k ) = F~^ Zi ^ Zki (u k \z 1 , . ..z k -i,p) -■ z k , k = 2, ...,re. (5.4) 

Note, that by construction it holds G(F(z,p),p) = z. From Riischendorf (2009) 
it is known that for U is uniformly distributed in [0, 1] K , the random vector Z = 
(Zi, . . . , Z K ) T = G(U,p) has the joint distribution density f(z,p). 

With the transformation G at hand we define the jump coefficient 7(77, u) by 

7*fr, U ) = W JA^p^p fOTU ^' k = l,...,d. (5.5) 
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Moreover, we choose the Poisson random measure N(dt, du) in t|4. 5|) such that the 
associated compound Poisson process has constant intensity A and jump heights 
which are uniformly distributed on U = [0, 1] K . Then, the compensator of N is 
v(du)dt = Xdudt and the compensated measure reads as N(dt,du) = N(dt,du) — 
Xdu dt. Note that with this definition the solution 7r t of (|4.5[) satisfies for some Borel 
set A c R^ 1 

P(An Tn G A | T Tn -) = / l A {l{^T n -,u))du 
Ju 

= J u lA {^( R7T T n -,G(u 7 Rir Tn _))yu 
= J z l A {li{R-KT n -,z)))J{z,R-K Tn -)dz. 

Hence with the above choice of 7 and N(dt, du), for constant h the process Rir, tt the 
solution of the SDE (|4.5j) , solves the martingale problem associated to the generator 
C h from dSU). Below we show that under Assumption 15. II the Lipschitz and growth 
conditions from Assumption 14. II hold, so that the SDE (|4.5j) has a unique solution. 
It is well-known that this implies that the martingale problem associated with C h 
has a unique solution, see for instance Jacod & Shiryaev (2003), Theorem III. 2. 26. 
Hence Rir has the same law as the state variable process p in (|3.4j) . which shows 
that we have achieved the desired reformulation of the dynamics of the problem in 
terms of an exogenous Poisson random measure. 

Verification of the Lipschitz and growth conditions. Next we show that 
under Assumption 15.11 the functions 7 fc (7r,«) satisfy the Lipschitz and growth con- 
ditions (|4.8|) and (|4.9[) . First note that the fact that all densities are C 1 with compact 
support Z = [a,b] implies the existence of constants C2,Cd < 00 such that for all 
1 < k < d, z G Z, 

f k (z) <C 2 and ^-fj{z) <C d , i = l,...,K. (5.6) 



Lemma 5.3. Under Assumption 15. Jl and for e < e := rajji^ the coefficient 
7(7r, u) defined in f)5 .5[) satisfies for tt G S_ e the Lipschitz and growth condition (|4.8[) 
and HO)l . 

Proof. We give the proof for the maximum norm | . 1^ in M. d . From this the assertion 
for the Euclidean norm can be deduced from the equivalence of norms. 

First we show that for n G S^, z G Z and e < e there are constants < C* < 
C* < 00 such that 

C,<J(z,Ric)<C*. (5.7) 
For this, observe that for p = Rtt, 

d 

j(z, P ) = £p»7 3 -(z) - 2 pv, w + E ( 5 - 8 ) 

J=l pJ<0 pJ>0 
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For the lower bound we deduce 

f(z,p) > V (-£)max/j(z) + V p> mmfj{z) 

3 3 
pi <0 >0 

>-e(d-l)C 2 + (l- -Ci > -e^ + l-Ci = C*, 

p->'<0 

where we used (|5.6p . p 5 ' > — £ and Y^j=iP'' = 1- For ^hc upper bound from 15.81 we 
find 

7(*,P) < + ]T p 7 max/^z) < (l - E P , ') Ca " (1 + " 1))C2 = °* 

p3>0 p3<0 

Note that the lower bound in (|5.7p implies that f(-,Rir) is strictly positive for 
7r G S_ e . Moreover, since the components of p = Rtt sum up to one by definition 
/(•,i?7r) is a strictly positive probability density for n £ S_ E . Hence, the inverse 
Rosenblatt transform G(u, Rtt) and thus the function 7(77, u) defined in (|5.5|) is well 
defined for tt £ (and not just for ir £ 5). 

Next we turn to the Lipschitz condition (|4.8|) , Clearly, (|4.8[) holds for some 
constant function p(it) = p if we can show that the derivatives of 7(77, u) with 
respect to ir J are bounded for all 1 < j < d — 1. This is obviously equivalent to 
estimating the derivatives of 

^ ; P V/(G(«,P),P) 
with respect to the components p J . Let 

C >^ : =7&(#^- 1 )' ^ = i>-^- 
J dp> \f(G(u,p),p) ' 

Then it holds 

The first term on the r.h.s. is bounded since it holds for k = 1, . . . , d and e < e 

fk(G{u,p)) < Q 



f(G(u,p),p) ~ G 

where we have used (|5.6[) and the lower bound for f(z,p) given in (|5.7[) . 

c k 
-3 



It remains to show that c!j(p,u) is bounded. Abbreviating z — z(p) = G(u,p) 



we find 



1 f ^ d d 



■AW-f/iW + EpT^wA^^p)) )• (5.io) 

»=i ;=i ' ^ 



Using (|5.6|) . Ylj=i b J l — 1 + (d ~ 1) £ an d estimate (|5.7[) for /, we derive for e < e 



3 

d 



"%P,u)\< ^(c^-^-G^p) 

1=1 



dp 3 



C* 



+C 2 - (c 2 + (l + (d-mC d J2\g^ Gl(u,P)\ ))■ (5-11 



2=1 



dpi 
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From the above inequality the boundedness of can be deduced from the bound- 
edness of the derivatives -^jGi{u,p). This is the assertion of the following lemma 
which is proven in the Appendix. 



Lemma 5.4. Under the assumptions of Lemma \5.3\ there exists a constant C > 
such that for j = 1, . . . , d and I = 1, . . . , k 

d 



< C. 



Proof of the growth condition (14.91) : Here we apply estimate (|5.9p and find 
fj(G(u,p)) 



-f(G(u,p),p) 

and hence \ j(p, w)!^ < /?(! + Moo) with some constant ~p. □ 



6. Regularized Dynamic Programming Equation 

In this section we introduce the regularized version of our dynamic programming 
problem and we discuss the convergence of reward and value function as the 
rcgularization-tcrms converge to zero. In Corollary 16.51 we finally show that optimal 
strategics in the regularized problem are nearly optimal in the original problem. 

Regularized state equation. Since regularization will drive the state process 
outside the set S we need to extend the definition of the coefficients a, (3 and 7 
from S to the whole W 1 ' 1 . For tt G K d ~\ h G K and e > we define 

a(n, h)(l — dist (ir,S_)/e) for n G 5 e 
otherwise. 



a(jr, h) 



Note, that S_ C S_ e and there is a continuous transition to zero if dist (jr, S_) reaches e. 
Moreover, on S_ it holds 5(7T, ft.) = a(w, h), i.e. the coefficients coincide. Analogously 
we define /3 and 7 as extensions of /3 and 7. 



Lemma 6.1. Under the assumptions of Lemma \4-l\ the coefficients a, (3 and 7 
satisfy the Lipschitz and growth conditions (|4.6|) to (|4.9|) for ir G R d_1 . 

Proof. The Lipschitz and growth conditions for the coefficients a, f3 and 7 given 
in Lemma 14.11 hold for ir G 5^ for £ < e. Multiplication of these functions by the 
bounded and Lipschitz continuous function I — dist (tt,S)/s preserves the Lipschitz 
and growth property. □ 

For the sake of simplicity of notation in the sequel we will suppress the tilde and 
simply write a, f3 and 7 instead of S, ft and 7. 

Next we define the dynamics of the regularized state process m 7r t 

d m Tr t = a( m 7r t , h t )dt + (3 T ( m Tr t )dB t + [ j( m Tr t -, u)N(dt, du) + -$=dB t (6.1) 
_ Jz~ V m 



where B t denotes a d — I-dimensional Brownian motion independent of B t . This 
state process is now driven by an n + d — f-dimcnsional Brownion motion. Note 
that the diffusion coefficient of the regularized equation (/3 T (7r t ), -^=I d _iY satisfies 
the Lipschitz and growth condition (|4.6|) and (|4.7p given in Lemma I4TT1 since ftfa) 
satisfies these conditions and ^=/d-i does not depend on p. 



13 



.^-Convergence m 7Tt —> TVt- We now compare the solution m 7r t of the regular- 
ized state equation (|6.ip with the solution n t of the unregularized state equation 
(|4.5[) and study asymptotic properties for m oo. This will be crucial for estab- 
lishing convergence of the associated reward function of the regularized problem to 
the original optimization problem. 

We assume that both processes start at time to £ [0, T] with the same initial 
value q £ S_, i.e. ™7r to = 7r fo = q. The corresponding solutions are denoted by 



m (t ,q,h) 



m_(*o,9,'0 



Lemma 6.2 (Uniform i^-convergence w.r.t. h £ H). It holds for m — >• oo 

2x 



E[ sup 

■ t <t<T 



m (t ,q,h) _ (t ,q,h) 



uniformly for h £ H. 



Proof. To simplify the notation we suppress the superscript (to, q, h) and write 7r t 
and m 7r t . Moreover, we denote by C a generic constant. 

We give the proof for to = 0, only. Using the corresponding representation as 
stochastic integrals for the solutions of the above SDEs we find 

m ir t - 7T t = Af + M 



where A r t 



t 



(a( m 7r s , h s ) - a(n s , h s ))ds and 



MT 1 = 



(P( m n s )- p(n s )) T dB s + 
dB t . 



( 1 ( m 7T s ,u)- 1 (TT s ,u))N(d S ,du) 



JU 



Denoting Gf := sup s<t | m 7r s - tt s | 2 ) it holds 



GT = E( sup \A? + M™\ 2 ) < 2E( sup |A 



2I?( S up|M s "f 



(6.2) 



For the first term on the r.h.s. we find by applying Cauchy-Schwarz inequality and 
the Lipschitz condition (|4.6I) for a 

2 

sup\A r s 

s<t 



sup 

s<t 



< sup s 



a( rn ir u , h u ) - a(ir u , h u ))du 

2 

a( m 7T„, /i tt ) - a(7r u , h u )) du 
<t- [ C L \ m TT u -TT u \ 2 du<t - [ C L sup\ m ir v -ir v \ 2 du. 

JO JO v<u 

Note that the constant Cl does not depend on h. Taking expectation it follows 

E(sup\A[ n \ 2 ) <t-C L f E(sup\ m Tr v -Tr v \ 2 ds) <C f G™ds. (6.3) 

\ s<t ' Jo ^ v<s ' Jo 

For the second term on the r.h.s. of (|6.2p Doob's inequality for martingales yields 
E(sup|Mf| 2 ) < mWr\ 2 ) = 4^J^ E{tT[(^s)-K7T s )) T (P( m n s )-^7T s ))})d S 



Ju 



E{ |7( m 7r s , u) - 7(tt s , it)) | 2 )v{du)ds + 



(d- l)i 



»7 



(6.4) 
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Using the Lipschitz conditions f|4.6[) and (|4.8[) for the coefficients /3 and 7 it follows 

i?(tr[(£("V s ) -^ s )) T (^r7T s ) -£(*.))]) < Ci^r^-TT,! 2 ) 

< C 2 E( sup | m 7r„-7r„ | 2 ) =C 2 L G? 

V<8 

E( \2( m n s ,u) - 7(tt S) u))\ 2 ) < p 2 {u)E{\ m K s - tt s | 2 ) 

<p 2 {u)E{^\ m n v --K v \ 2 ) =p\u)G™ 

Substituting the above estimates into (|6.4[) it follows that 



£(sup|M s m | 2 ) <A( [ C 2 L G™ds + f G™ds f p 2 (u)v{du 

^ s<t I \ Jo Jo Ju 



(d-l)t 



<cfG™ds+ ^ d l) \ (6.5) 
Jo 171 

Substituting ([573]) and (|6~5|) into (|672|) we find 

GI™ < ^ T + C f GTds. 

m Jo 

Finally we apply Gronwall Lemma to derive 

4(d-l)T e cT^ Q fo,. go 
m 

which concludes the proof. □ 



Note, that the -^-convergence for the restricted state process m TTt established 
in Lemma 16.21 also holds for the associated d-dimcnsional process m p t = R m ir t . 

We now extend the notions of reward and value function given in ()3.7[) to the 
process m p t = R m n t with m 7r t satisfying the regularized state equation (|6.1[) . Since 
m p t takes values in R d and not only in S we extend the function b given in (|3.2p 
with (p,h) G S x K i-> b(p,h) G R to p = Rn G K d . With the notation b* = 
min{fe(p, h), p E S,h E K} and b* = max{6(p, h), p G S,h G K} we define 

b(p,h) := (b(p, h) V &*) A b* . 

Then b is bounded on R d x K and for p G S the function & coincides with b. For 
the sake of simple notation in the sequel we simply write b instead of b. We define 
the reward and value function associated to the regularized state equation (|6.f p by 

UT 
-&(i?( Tn 7if '^J), h s )ds^ for h G W, 

V m (t,7r) = sup{v m (t,n,h): h G %}. 



The reward and value function associated to the unregularized state equation (|4.5[) 
are denoted by v(t,TT,h) and V(t,ir). The generator associated to the solution of 
the regularized state equation ()6.1[) reads as 



d-1 d-1 d-1 

i,j=l 1 i—1 

+ / {9{^ + l{^,u)) - g(n)}v(du) 



u 
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and the associated dynamic programming equation is 

V t m (t,Ti) + sup \ m £ h V m (t,TT) - b(Rn,h;8)V m (t,n)} = 0, (t, tt) g [0,T) x M^ 1 . 

Note, that for the generator m £ h the ellipticity condition for the coefficients of 
the second derivatives holds: we have for all z G R d_1 \ {0} 

z T (f3 T f3+^-I d _ 1 )z = z T l3 T f3z+^-z T z= \f3z\ 2 + ^-\z\ 2 > 0. 
2m 2m — 2m 

Hence the results of Davis & Lleo (2011) yield the existence of a classical solution 
V rn . The optimal strategy m h* of the regularized problem can be computed by a 
similar reasoning as Section [3] In particular, for K sufficiently large it follows that 

d-l 

(l-9) ( "" ' r""" ' V m (t,irf 



*h* = m h*(t,n) = ^^( (T(T T r 1 {A/i? 7 r+-^— ^fitW^tU)} (6-6) 



fc=i 



where f3 k (ir) = Tt k a 1 [fJ- k — ^""^ ^j^j an< ^ f 1 ^ : ~ — fJ-d, k = l, ...,d— 1. 

i=i 

Convergence of reward and value function. The next theorem on the uniform 
convergence of reward functions is our main result; convergence of the value function 
and e-optimality of follow easily from this theorem. 

Theorem 6.3 (Uniform Convergence of reward functions). It holds 

sup \v m (t, it, h)-v(t, it, h)\^0 /or TTWoo, t G [0, T], tt G «S. 

/!.£"« 

Proof. We introduce the notation 

J:=y -b(RTT { s t ^^ h \h s )ds and J m := j -6(ii TB 7r< t ' ,r ' h >, fc,)ds. 

Then the reward functions read as v(t, tt, h) = E(e J ) and v m (t, tt, h) = E{e jm ) and 
it holds 

\v m {t,TT,h) - v(t,TT,h)\ = \E(e jm -e J )\ < E(\e jm - e J |) 

< C£(| J m - J\), (6.7) 

where we used Lipschitz continuity of f(x) = e x on bounded intervals and the 
boundedness of J and J m which follows from the the boundedness of b. Using 
Lipschitz continuity of b we derive 

E(\ J m - J\) = e( J [b{RTT^ n ' h \h s )-b{R m TTi t ^^,h s )]ds 

< J CE{\ m TT^^ - TT^^\)d S 

< C J (e^k^M - 7Ti t,w ' h) \ 2 ) V2 ds -> (6.8) 

for m — >• oo and uniformly w.r.t. h £ H which follows from Lemma 16.21 Plugging 
(HEU) into dUTJ) we find 



sup \v m (t, n, h) — v(t, TT,h)\ — > for m — > oo. 
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□ 



Corollary 6.4 (Convergence of value functions). It holds 

V m {t,Ti) -> V(t,ir) /orm-)oo, t G [0,T], tt G 5. 



Proof. For G (0, 1) the assertion follows from 



\V m (t,n)-V(t,n)\ 



= SUP V m (t, TT, ft,) — SUp v(t, TT, ft) 

?igw hen 



< SUp |w m (t,7T, ft) - v(t,7T,h)\ 



and Lemma 15751 Analogously, for 9 < it follows 



|F m (t,7r)-U(i,7r)| 



= inf v m (t,TT,h) — inf v(t,w,h) 




= SUp( — U m (t,7T, ft)) — SUp( — 7T, ft)) 
/iG« /iGH 



< sup |w m (t,7r, ft) - v(t,ir,h)\. 



□ 



On e-optimal stratgies. Finally we show that the optimal strategy m h* for the 
regularized problem given in (|6.6p is £-optimal in the original problem. This gives 
a method for computing (nearly) optimal strategies. 

Corollary 6.5 (e-optimality). For every e > there exists some too G N such 
that 

\V(t,n) - v(t,TT, m h*)\ < e form>m , 
i.e. m h* is an e-optimal strategy for the original control problem. 

Proof. It holds 

|V(t,7r) - v(t,7r," l ft*)| < |V(t,7r) - v m {t,ir, m h*)\ + \v m (t, tt, m h*) - v(t, tt, m h*)\ 
= \V(t, n) - V m (t, tt)| + \v m (t, tt, m h*) - v(t, tt, m h*)\ (6.9) 

where for the first term on the r.h.s. we used v m (t,x, m h*) — V m (t,Tr). Using the 
convergence properties for the reward function given in Lcmma l6.3l and for the value 
function given in Corollary 16.41 we can find for every e > some mo G N such that 
for to > Too it holds 



Remark 6.6. Note that in the proof of the corollary we use that the sequence of 
reward functions v m converges to v uniformly in ft. This is a stronger property than 
convergence of the value functions V m to V so that standard stability results for 
dynamic programming equations are not sufficient to proof the corollary. 



\V(t,TT)-V m (t,Tr)\< | and \v m (t,Trrh*)-v(t,TT, m h*)\ < |. 



Plugging the above estimates into (|6.9[) it follows for to > 7710 



\V{t,TT)- V {t,TT, m h*)\< £ -+ £ - 



= e. 



□ 
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Appendix A. Proof of Lemma 15.41 

We derive from differentiating Gi(F(z,p),p) = z; w.r.t. pj using the chain rule 

^ d d d 

J2 ^-Gi{F{z ilP ),p)-^Fi{z,p) + ^-jGi(F(z,p),p) = 0. 
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Substituting u = F(z,p) we obtain the estimate 







(Appendix A.l) 



(i) For the proof of the boundedness of the derivatives on the r.h.s. we need the 
following auxiliary estimates for the marginal densities fz x ...z k , k = 1, . . . , K given 
in (|5.2p . From estimate (|5 . 7[) for / we derive the estimate 



C* ' [ (bi-ai) < f Zl ...z k (zi,...,z k ,p) < C* Y[ (h(A m )mdix A.2) 

i=fe+l i=k+l 

For the derivatives of the marginal densities w.r.t. p? the definition of / in (|2.7j) 
yields 

Q r b k+i rb K 

-^—fz 1 ...z k (zi,...,z k ,p)= / .../ fj(zi,...,z k ,Sk+i,...,s K )ds k +i...ds K . 

°P J Ja k+1 Ja K 

From Assumption 15.11 and (|5.6[) it follows 

0<Ci f[ {b i ~a l )<-^-f Zl ...z k (zi 

i=k+l ^ 

For the derivatives of the marginal densities w.r.t. Zj, j = 1 . . . , k we find 
d 



,z k ,p)<C 2 [ ((Appendix A.3) 

i=k+l 



-fz 1 ...z k (zi, . . .,Zk,p) 



< 



b K d 



d 



^2p 1 fi(zi, ■ ■ .,Zk,s k +i,.. .,s K ) 

a k+1 Ja K j =1 1 ° z j 



ds k+1 ...ds K 

(Appendix A.4) 



< c d J] (in - a, t ), 

i=k+l 

where the upper bound from (|5.6p on the derivatives of the densities fj has been 
used. 

(ii) Now we can prove the boundedness for the second term r.h.s. of ( |Appendix A.l ). 
For k = 2, . . . , k we obtain from the definition of F(z,p) in (|5.3[) 



_d_ 

dp] 



Fi(z,p) 



z k Q 
a k dp] 

Zk d f Zl ...z k (zi,...,z k -i,s k ,p) 



fz k \z 1 ...z k ^ 1 { s k\zi, ■ ■ ■ z k -t,p)ds k 

ds k 



a k dpi f Zl ...z k _ 1 {zi,----,z k -i 1 p) 



< 



1 



a k f. 



k J Z\...Z k 



J') 



-Q—fz 1 ...z k {-) fz 1 ...z k _ 1 {-) + 



d \ 
fzL.-Zki-) Q--fz 1 ...z k - 1 (-) jds k 



< C. 



Here, we have used estimate ( |Appcndix A.2| ), which states that the marginal den- 
sities are bounded from above and bounded away from zero, and ( |Appcndix A.3[ ) 
for the boundedness of the derivatives of the marginal densities w.r.t. pP . 
For k = 1 we observe that 



8 







—F 1 (z,p) = — 1 F Zl (zi,p)= / — f Zl (s 1 ,p)ds 1 . 



dp> 



dp> 







at dp 3 ' 
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The boundedness d/dp>Fi(z,p) is a consequence of estimate \ Appendix A.3[ ). 

(iii) For proving the boundedness of d/duiGi(u,p) in ( | Appendix A.l I we consider 
the Jacobian matrices for G(u) and F(z) defined by 



J G {u) := (^-Gi(u,p)) and J F (z) := (^-F^p)) 

\OUj /i.7=l...,K \OZ, I 



Below we show that for z = G(u,p) the matrix J F (z) is regular, hence J G (u) — 

J~ (G(u,p)), since G(F(z,p),p) = z. From the definition of F in (15.31) it follows 

.r 

that J F (z) is a lower triangular matrix since Fk depends on z\, . . . , Zk only. 

Next we consider the diagonal elements of J F (z). Using ( |Appcndix A.2[ ) we find 
constants G_ and C such that C < fz!...z k (zi, . . . , z k ,p) < C for all k = 1 . . . , n. 
Then it holds with S := win{C, C/C} 



d ~ 

—Fi(z,p) = f Zl (zi,p)>5 and 
d ~ 

—F k (z,p) = fz k \z 1 ...z k - 1 {zk\zi,.-.z k - 1 ,p) 



fzx...Z k (Zl,...,Z k) p) 



for k = 2, . . . , K. Since J F {z) is triangular, its determinant is 
det(J^z)) = f[ -^h{z,p) > S K > 0, 



>S, 



fc=i 



hence J F (z) is invertible. 

Next we show that the the non-zero off-diagonal elements of J F are bounded. 
,K, j = 1, ...,k-l 



It holds for k 
_d_ 

dzj 



2. 

Fk(z,p) 



-fz„\z 1 ...z k _ 1 {s k \zi, ■ ■ ■ z k -x,p)ds k 



hi. 



d fzi...z k {zi,--.,z k - u s k ,p) 
dzj fzi...z k -Azi,... t Zk-i,p) 



dsi 



< 



d 



fk...z k _S-)\ 9 Zj 



!z x ...z k {-) /zi...Z fc _i(-) 

d 



\ 

+fzi...z k (-) -g^fz 1 ...z k _ 1 {-) Jds k 



ds k < C. 



Here again we have used that the marginal densities are bounded from above and 
bounded away from zero, and ( |Appendix A. 4 ) for the boundedness of the derivatives 
of the marginal densities w.r.t. Zj. 

For proving the boundedness of d/duiGi(u,p) in ( | Appendix A.l ) which are the 
entries of the Jacobian matrix J G (u) we use that J G is the inverse of J F . Since J F 
ist a triangular matrix the entries of J can be computed recursively by Gaussian 
elimination starting with the first row. This gives that for k, I = 1, . . . , K 

h— 1 

J kl = TF (*« - J2 J kJ J fl) ' 
J kk j=l 

i.e. the entry J G t can be represented by an affinc linear combination of the bounded 
off-diagonal entries in row k of J F divided by J F k . The latter is strictly positive 
and bounded from below by S > 0. Hence, all entries of J G are bounded. 



20 



